A Web-Page Summarization for Just-in-Time Contextual Advertising

نویسندگان

  • ARIS ANAGNOSTOPOULOS
  • ANDREI Z. BRODER
  • VANJA JOSIFOVSKI
چکیده

Contextual advertising is a type of Web advertising, which, given the URL of a Web page, aims to embed into the page the most relevant textual ads available. For static pages that are displayed repeatedly, the matching of ads can be based on prior analysis of their entire content; however, often ads need to be matched to new or dynamically created pages that cannot be processed ahead of time. Analyzing the entire content of such pages on-the-fly entails prohibitive communication and latency costs. To solve the threehorned dilemma of either low-relevance or high-latency or high-load, we propose to use text-summarization techniques paired with external knowledge (exogenous to the page) to craft short page summaries in real time. Empirical evaluation proves that matching ads on the basis of such summaries does not sacrifice relevance, and is competitive with matching based on the entire page content. Specifically, we found that analyzing a carefully selected 6% fraction of the page text can sacrifice only 1%–3% in ad relevance. Furthermore, our summaries are fully compatible with the standard JavaScript mechanisms used for ad placement: they can be produced at ad-display time by simple additions to the usual script, and they only add 500–600 bytes to the usual request. We also compared our summarization approach, which is based on structural properties of the HTML content of the page, with a more principled one based on one of the standard text summarization tools (MEAD), and found their performance to be comparable.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Snippets in Text Summarization: a Comparative Study and an Application

Automatic text summarization consists of automatically creating a summary of one or more texts. As for Web pages, unfortunately classical techniques cannot be applied in presence of dynamic contents. In this paper, we propose the adoption of snippets –i.e., page excerpts provided together with user query results by search engines– as a text summarization technique. The study is conducted along ...

متن کامل

Experimenting Text Summarization Techniques for Contextual Advertising

Contextual advertising systems suggest suitable advertisings to users while surfing the Web. Focusing on text summarization, we propose novel techniques for contextual advertising. Comparative experiments between these techniques and existing ones have been performed.

متن کامل

Semantic Associations for Contextual Advertising

Contextual advertising systems place ads automatically in Web pages, based on the Web page content. In this paper we present a machine learning approach to contextual advertising using a novel set of features which aims to capture subtle semantic associations between the vocabularies of the ad and the Web page. We design a model for ranking ads with respect to a page which is learned using Supp...

متن کامل

Development of Display Ads Retrieval System to Match Publisher's Contents

The technological transformation and automation of digital content delivery has revolutionized the media industry. Advertising landscape is gradually shifting its traditional media forms to the emergent of Internet advertising. In this paper, the types of internet advertising to be discussed on are contextual and sponsored search ads. These types of advertising have the central challenge of fin...

متن کامل

An ontology-based approach to Chinese semantic advertising

In the web advertising domain, contextual advertising and sponsored search are two of the main advertising channels used to display related advertisements on web pages. A major challenge for contextual advertising is to match advertisements and web pages based on their semantics. When a web page and its semantically related advertisements contain many different words, the performance of the tra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008